CLAIMS 

What is claimed is: 

1 1. An integrated voice activation detector for 

2 detecting whether voice is present, the integrated voice 

3 activation detector comprising: 

O 4 a semiconductor integrated circuit including, 

iD 5 at least one signal processing unit to perform 

w 

65 6 voice detection; and 

5 7 a processor readable storage means to store 

8 signal processing instructions for execution by the 

W 

S3 

9 at least one signal processing unit to: 

w 

P 10 detect whether noise is present to 

11 determine whether a noise flag should be set; 

12 detect a predetermined number of zero 

13 crossings to determine whether a zero crossing 

14 flag should be set; 

15 detect whether a threshold amount of 

16 energy is present to determine whether an 

17 energy flag should be set; 

18 detect whether instantaneous energy is 
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present to determine whether a instantaneous 
energy flag should be set; and 

utilize a combination of the noise, 
zero crossing, energy, and instantaneous energy 
flags to determine whether voice is present . 

2 . The integrated voice activation detector of 
claim 1, wherein the signal processing instructions 
further for execution by the at least one signal 
processing unit to, perform fast Fourier transformation 
(FFT) processing to determine whether a FFT flag should 
be set . 

3. The integrated voice activation detector of 
claim 1, wherein the signal processing instructions 
further for execution by the at least one signal 
processing unit to, perform an interim voice activity 
decision, a interim voice activity decision flag being 
set to indicate voice has been detected by determining if 
the instantaneous energy flag is set or the energy flag 
is set and the noise flag is not set and the zero 
crossing flag is not set. 
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4 . The integrated voice activation detector of 
claim 3, wherein the signal processing instructions 
further for execution by the at least one signal 
processing unit to, perform HangOver and Speech Kick in 
processing after the interim voice activity decision has 
been made to determine whether a voice activity flag 
should be set or cleared. 

5. The integrated voice activation detector of 
claim 4, wherein the signal processing instructions 
further for execution by the at least one signal 
processing unit to, if the voice activity flag is set, 
send a speech payload to be packet i zed and update the 
voice activity detection flag for external interaction 
with other functions of the semiconductor integrated 
circuit . 

6. The integrated voice activation detector of 
claim 4, wherein the signal processing instructions 
further for execution by the at least one signal 
processing unit to, if the voice activity flag is not 
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* 



5 set, disable an automatic level control and cause a 

6 silence insertion description payload to be prepared. 

1 7. The integrated voice activation detector of 

2 claim 1, wherein detecting a predetermined number of zero 

3 crossings to determine whether a zero crossing flag 

p 4 should be set includes determining whether a root mean 

5 square crossing value is greater than a threshold value. 

w 

P 1 8 . The integrated voice activation detector of 

/ 2 claim 1, wherein detecting whether noise is present to 

Q 

B(J 3 determine whether a noise flag should be set includes 

m 

Ly 4 determining whether energy in a current frame multiplied 

□ 

M 5 5 by a threshold is greater than delayed frame energy. 

1 9. The integrated voice activation detector of 

2 claim 1, wherein detecting whether a threshold amount of 

3 energy is present to determine whether an energy flag 

4 should be set includes determining if a logarithm of an 

5 autocorrelation of a frame is greater than an energy 

6 threshold. 
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1 10 . The integrated voice activation detector of 

2 claim 1, wherein detecting whether instantaneous energy 

3 is present to determine whether an instantaneous energy 

4 flag should be set includes determining whether a 

5 difference between a current frames energy at an 

6 autocorrelation of a tenth delayed sample and a prior 

7 frames energy at an autocorrelation of a tenth delayed 

8 sample is greater than a previous frames autocorrelation 
jp 9 multiplied by a threshold. 

P 

gj 1 11. A method for voice activation detection to 

ru 

y 2 detect whether voice is present, the method comprising: 

P 

U 3 detecting whether noise is present to 

4 determine whether a noise flag should be set; 

5 detecting a predetermined number of 

6 zero crossings to determine whether a zero 

7 crossing flag should be set; 

8 detecting whether a threshold amount of 

9 energy is present to determine whether an 

10 energy flag should be set; 

11 detecting whether instantaneous energy 
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12 is present to determine whether a instantaneous 

13 energy flag should be set; and 

14 utilizing a combination of the noise, 

15 zero crossing, energy, and instantaneous energy 

16 flags to determine whether voice is present. 

1 12. The method of claim 11, further comprising, 

^ 2 performing fast Fourier transformation (FFT) processing 

y3 

Jff 3 to determine whether a FFT flag should be set. 

w 

S 

1 13. The method of claim 11, further comprising, 

J; 2 performing an interim voice activity decision, a interim 

3 voice activity decision flag being set to indicate that 

O 

4 voice has been detected by determining if the 

5 instantaneous energy flag is set or the energy flag is 

6 set and the noise flag is not set and the zero crossing 

7 flag is not set. 

1 14. The method of claim 13, further comprising, 

2 performing HangOver and Speech Kick in processing after 

3 the interim voice activity decision has been made to 

4 determine whether a voice activity flag should be set or 
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cleared . 

15. The method of claim 14, further comprising, if 
the voice activity flag is set, sending a speech payload 
to be packetized and updating the voice activity 
detection flag for external interaction with other 
functions. 

16. The method of claim 14, further comprising, if 
the voice activity flag is not set, disabling an 
automatic level control and causing a silence insertion 
description payload to be prepared. 

17. The method of claim 11, wherein detecting a 
predetermined number of zero crossings to determine 
whether a zero crossing flag should be set includes 
determining whether a root mean square crossing value is 
greater than a threshold value. 

18. The method of claim 11, wherein detecting 
whether noise is present to determine whether a noise 
flag should be set includes determining whether energy in 
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a current frame multiplied by a threshold is greater than 
delayed frame energy. 

19. The method of claim 11, wherein detecting 
whether a threshold amount of energy is present to 
determine whether an energy flag should be set includes 
determining if a logarithm of an autocorrelation of a 
frame is greater than an energy threshold. 

20. The method of claim 11, wherein detecting 
whether instantaneous energy is present to determine 
whether an instantaneous energy flag should be set 
includes determining whether a difference between a 
current frames energy at an autocorrelation of a tenth 
delayed sample and a prior frames energy at an 
autocorrelation of a tenth delayed sample is greater than 
a previous frames autocorrelation multiplied by a 
threshold . 

21. An apparatus comprising: 

at least one signal processing unit to perform 
voice detection; and 
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4 a storage device to store signal processing 

5 instructions for execution by the at least one 

6 signal processing unit to: 

7 determine whether a noise flag, a zero 

8 crossing flag, an energy flag, and an 

9 instantaneous energy flag should be set; and 

^ 10 utilize a combination of the noise, 

Li 

11 zero crossing, energy, and instantaneous energy 

(jJ 

gQ 12 flags to determine whether voice is present. 
h& 

O 

2" 

a 1 2 2.. The apparatus of claim 21, wherein the signal 
gj 2 processing instructions further for execution by the at 

§y 

hj 3 least one signal processing unit to: 

3 

IM* 4 detect whether noise is present to 

5 determine whether the noise flag should be set; 

6 detect a predetermined number of zero 

7 crossings to determine whether the zero 

8 crossing flag should be set; 

9 detect whether a threshold amount of 

10 energy is present to determine whether the 

11 energy flag should be set; and 

12 detect whether instantaneous energy is 
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present to determine whether the instantaneous 
energy flag should be set. 

23. The apparatus of claim 21, wherein the signal 
processing instructions further for execution by the at 
least one signal processing unit to, perform fast Fourier 
transformation (FFT) processing to determine whether a 
FFT flag should be set. 

24. The apparatus of claim 21, wherein the signal 
processing instructions further for execution by the at 
least one signal processing unit to, perform an interim 
voice activity decision, a interim voice activity 
decision flag being set to indicate voice has been 
detected by determining if the instantaneous energy flag 
is set or the energy flag is set and the noise flag is 
not set and the zero crossing flag is not set . 

25. The apparatus of claim 24, wherein the signal 
processing instructions further for execution by the at 
least one signal processing unit to, perform HangOver and 
Speech Kick in processing after the interim voice 
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5 activity decision has been made to determine whether a 

6 voice activity flag should be set or cleared. 

1 26. The apparatus of claim 25, wherein the signal 

2 processing instructions further for execution by the at 

3 least one signal processing unit to, if the voice 

~» 4 activity flag is set, send a speech payload to be 

u 

yp 5 packetized and update the voice activity detection flag 

6 for external interaction with other functions of the 

H 

O 7 semiconductor integrated circuit . 

P 

JO 1 27. The apparatus of claim 25, wherein the signal 

tij 2 processing instructions further for execution by the at 

S3 

H 3 least one signal processing unit to, if the voice 

4 activity flag is not set, disable an automatic level 

5 control and cause a silence insertion description payload 

6 to be prepared. 

1 28. The apparatus of claim 22, wherein detecting a 

2 predetermined number of zero crossings to determine 

3 whether a zero crossing flag should be set includes 

4 determining whether a root mean square crossing value is 
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greater than a threshold value. 

29. The apparatus of claim 22, wherein detecting 
whether noise is present to determine whether a noise 
flag should be set includes determining whether energy in 
a current frame multiplied by a threshold is greater than 
delayed frame energy. 

30. The apparatus of claim 22, wherein detecting 
whether a threshold amount of energy is present to 
determine whether an energy flag should be set includes 
determining if a logarithm of an autocorrelation of a 
frame is greater than an energy threshold. 

31. The apparatus of claim 22, wherein detecting 
whether instantaneous energy is present to determine 
whether an instantaneous energy flag should be set 
includes determining whether a difference between a 
current frames energy at an autocorrelation of a tenth 
delayed sample and a prior frames energy at an 
autocorrelation of a tenth delayed sample is greater than 
a previous frames autocorrelation multiplied by a 
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9 threshold. 

1 32 , A method comprising: 

2 determining whether a noise flag, a zero 

3 crossing flag, an energy flag, and an 

4 instantaneous energy flag should be set; and 

5 utilizing a combination of the noise, zero 

6 crossing, energy, and instantaneous energy 

7 flags to determine whether voice is present. 

1 33. The method of claim 32, further comprising: 

2 detecting whether noise is present to 

3 determine whether the noise flag should be set; 

4 detecting a predetermined number of zero 

5 crossings to determine whether the zero 

6 crossing flag should be set; 

7 detecting whether a threshold amount of 

8 energy is present to determine whether the 

9 energy flag should be set; and 

10 detecting whether instantaneous energy is 

11 present to determine whether the instantaneous 

12 energy flag should be set. 
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34. The method of claim 33, further comprising, 
performing fast Fourier transformation (FFT) processing 
to determine whether a FFT flag should be set.- 

35. The method of claim 32, further comprising, 
performing an interim voice activity decision, a interim 
voice activity decision flag being set to indicate that 
voice has been detected by determining if the 
instantaneous energy flag is set or the energy flag is 
set and the noise flag is not set and the zero crossing 
flag is not set. 

36. The method of claim 35, further comprising, 
performing HangOver and Speech Kick in processing after 
the interim voice activity decision has been made to 
determine whether a voice activity flag should be set or 
cleared . 

37. The method of claim 36, further comprising, if 
the voice activity flag is set, sending a speech payload 
to be packetized and updating the voice activity 
detection flag for external interaction with other 
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functions . 

38. The method of claim 36, further comprising, if 
the voice activity flag is not set, disabling an 
automatic level control and causing a silence insertion 
description payload to be prepared. 

39. The method of claim 33, wherein detecting a 
predetermined number of zero crossings to determine 
whether a zero crossing flag should be set includes 
determining whether a root mean square crossing value is 
greater than a threshold value. 

40. The method of claim 33, wherein detecting 
whether noise is present to determine whether a noise 
flag should be set includes determining whether energy in 
a current frame multiplied by a threshold is greater than 
de 1 ay ed f r ame ene rgy . 

41. The method of claim 33, wherein detecting 
whether a threshold amount of energy is present to 
determine whether an energy flag should be set includes 
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4 determining if a logarithm of an autocorrelation of a 

5 frame is greater than an energy threshold. 

1 42. The method of claim 33, wherein detecting 

2 whether instantaneous energy is present to determine 

3 whether an instantaneous energy flag should be set 

_ 4 includes determining whether a difference between a 

O 
tO 

^ 5 current frames enerqy at an autocorrelation of a tenth 

MS 

gp 6 delayed sample and a prior frames energy at an 

p 7 autocorrelation of a tenth delayed sample is greater than 

^ 8a previous frames autocorrelation multiplied by a 



CQ 9 threshold. 

ru 

iu 

m 1 43 . A machine -readable medium having stored thereon 

2 instructions, which when executed by a machine, causes 

3 the machine to perform operations comprising: 

4 determining whether a noise flag, a zero 

5 crossing flag, an energy flag, and an 

6 instantaneous energy flag should be set; and 

7 utilizing a combination of the noise, zero 

8 crossing, energy, and instantaneous energy 

9 flags to determine whether voice is present. 
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44. The machine- readable medium of claim 43, furthe 
comprising : 

detecting whether noise is present to 
determine whether the noise flag should be set 

detecting a predetermined number of zero 
crossings to determine whether the zero 
crossing flag should be set; 

detecting whether a threshold amount of 
energy is present to determine whether the 
energy flag should be set; and 

detecting whether instantaneous energy is 
present to determine whether the instantaneous 
energy flag should be set. 

45. The machine-readable medium of claim 43, furthe 
comprising, performing fast Fourier transformation (FFT) 
processing to determine whether a FFT flag should be set 

46. The machine-readable medium of claim 43, furthe 
comprising, performing an interim voice activity 
decision, a interim voice activity decision flag being 
set to indicate that voice has been detected by 
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determining if the instantaneous energy flag is set or 
the energy flag is set and the noise flag is not set and 
the zero crossing flag is not set. 

47. The machine-readable medium of claim 46, further 
comprising, performing HangOver and Speech Kick in 
processing after the interim voice activity decision has 
been made to determine whether a voice activity flag 
should be set or cleared. 

48. The machine -readable medium of claim 47, further 
comprising, if the voice activity flag is set, sending a 
speech payload to be packetized and updating the voice 
activity detection flag for external interaction with 
other functions. 

49. The machine -readable medium of claim 47, further 
comprising, if the voice activity flag is not set, 
disabling an automatic level control and causing a 
silence insertion description payload to be prepared. 

50. The machine- readable medium of claim 44, wherein 
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detecting a predetermined number of zero crossings to 
determine whether a zero crossing flag should be set 
includes determining whether a root mean square crossing 
value is greater than a threshold value. 

51. The machine- readable medium of claim 44, wherein 
detecting whether noise is present to determine whether a 
noise flag should be set includes determining whether 
energy in a current frame multiplied by a threshold is 
greater than delayed frame energy. 

52. The machine- readable medium of claim 44, wherein 
detecting whether a threshold amount of energy is present 
to determine whether an energy flag should be set 
includes determining if a logarithm of an autocorrelation 
of a frame is greater than an energy threshold. 

53. The machine-readable medium of claim 44, wherein 
detecting whether instantaneous energy is present to 
determine whether an instantaneous energy flag should be 
set includes determining whether a difference between a 
current frames energy at an autocorrelation of a tenth 
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s : 

lew 
2 



ni 

w 



6 delayed sample and a prior frames energy at an 

7 autocorrelation of a tenth delayed sample is greater than 

8 a previous frames autocorrelation multiplied by a 

9 threshold. 
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